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Persistently Exciting Tube MPC 


Bernardo Hernandez l a and Paul Trodden 1 2 


Abstract —This paper presents a new approach to deal with 
the dual problem of system identification and regulation. The 
main feature consists in breaking the control input to the 
system into a regulator part and a persistently exciting part. 
The former is used to regulate the plant using a robust 
MPC formulation, in which the latter is treated as a bounded 
additive disturbance. The identification process is executed by a 
simple recursive least squares algorithm. In order to guarantee 
sufficient excitation for the identification, an additional non- 
convex constraint is enforced over the persistently exciting part. 


I. Introduction 

The performance and optimality of a model predictive 
controller are limited by the accuracy of the model used 
to make the predictions [1], [2]. If the model is a poor 
representation of reality, the resulting control actions will 
not be optimal for the latter, causing unexpected behaviour. 
Moreover, desirable properties in model predictive control 
(MPC) formulations, such as stability and feasibility [3], [4], 
often require the computation of invariant sets, which are 
model dependent [5]. Adaptive MPC, as a way to cope with 
model uncertainty, has been receiving increasing amounts of 
attention from researchers in the last ten years, but it remains 
a largely open problem [6]. Adaptive MPC attempts to couple 
closed-loop system identification with regulation through an 
MPC controller; an inherent difficulty of such design is that 
the two objectives are incompatible. This is referred to as 
the dual control problem: while the controller tries to steady 
the system, the identifier needs to excite it [7]. 

A key challenge for adaptive MPC is how to maintain the 
stability and feasibility guarantees, particularly when hard 
constraints are considered. In [8] the model of an uncon¬ 
strained plant is updated through a modified recursive least 
squares (RLS) algorithm, while a fuzzy supervisor attempts 
to update the controller parameters based on performance 
criteria. These include a numeric evaluation of the system 
stability, but no guarantee is given. [9] uses a single value 
decomposition based algorithm to estimate, on-line, a state 
space model of the controlled system; stability is shown 
solely through numerical simulations. A set membership 
identification scheme is used in [10], coupled with additional 
output constraints in the optimization problem. These extra 
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constraints ensure boundedness of the system response and 
hence constraint satisfaction, however open-loop stability is 
assumed. Other authors have addressed the issue of stability 
by making suitable assumptions, such as in [1], [11], [12], 
where the response error produced by the uncertainty of the 
model is treated as a bounded disturbance. This allows for 
robust MPC implementations to be used under some supple¬ 
mentary assumptions, such as a known bound on the initial 
estimates error [11], [1], A novel algorithm is developed in 
[2], where two models of the plant are maintained by the 
controller; a nominal model is used to provide feasibility 
and stability guarantees, while the second, adaptive, model 
is used to improve performance. However, none of these 
approaches consider the dual control problem in an explicit 
way. In [11] the identifier chooses suitable past data for the 
computation of estimate bounds, but nothing ensures the 
existence of this data. In [1] a min-max robust approach 
accounts for future estimation but does not directly promote 
proper excitation. 

In the MPC context, this matter has been addressed in 
different ways. In [13]—[18], an additional constraint over 
the input is explicitly added to guarantee enough information 
on the output. However, feasibility of the optimization may 
be lost due to this constraint, which is in most cases non- 
convex. To overcome this, a relaxation variable is employed 
in [13], [15], yet this may result in insufficient excitation in 
favour of feasibility. The receding horizon fashion of MPC 
(i.e., that only the first part of the optimized input sequence is 
applied) is exploited in [16], where the additional constraint 
is applied exclusively over the first element of the predicted 
input sequence; this concept is also studied in [18], where 
it is shown that extending this type of constraint beyond 
one time step into the prediction horizon has a negligible 
impact on the identification results. Alternatively, a two- 
step optimization is performed in [19], [20]; the first step 
solves a standard MPC problem and the second step adds an 
exciting behaviour to the optimized sequence, while limiting 
the cost increase (reduced optimality). The concept of zone¬ 
tracking MPC is used in [21] to drive the state of the plant 
to an invariant set, inside which a persistently exciting input 
sequence can be safely applied. In recent implementations 
[18], [22], the MPC cost function is augmented with a term 
depending on the variance of the estimated parameters error, 
in an attempt to force the optimizer to choose an exciting 
input sequence, that automatically decays as the system is 
correctly identified. 

In this paper, the dual problem of regulation and system 
identification is addressed within the frame of robust MPC; 
the main feature of the present algorithm is the division of 



the input signal. The first part, called the persistently exciting 
(PE) part, aims to generate enough information for the identi¬ 
fication process, while the second (regulator) part is designed 
following the main objective of regulating the plant to the 
desired steady state. From the control perspective, the PE part 
of the input may be treated as a bounded disturbance, hence 
a standard tube MPC formulation [23] is a suitable selection 
for the regulation task. This allows to maintain the classic 
form of the optimization problem, unlike [ 18]—[20], [22], and 
also helps to provide a guarantee of asymptotic stability. A 
standard RLS algorithm [24] with forgetting factor is used 
for the identification process. To achieve convergence of the 
estimates, an additional constraint based on the persistence 
of excitation theory [7] is included in the optimization. This 
ensures not only an accurate estimation but also that the PE 
input is automatically defined by the optimizer (contrary to 
the approach in [21], where it must be computed off-line). 
The PE constraint proposed in [16] is tightened so as to 
guarantee recursive feasibility while allowing for increased 
optimality. The disturbances considered at this stage arise 
from model mismatch and the excitation part of the input, 
thus the approach is inherently deterministic. 

The paper is organized as follows: Section II defines the 
problem and required preliminaries. Section III describes the 
proposed persistently exciting tube MPC (PE-Tube MPC). 
Stability and feasibility are established in Section IV. Section 
V contains numerical simulation results. 

Notation: The operators ® and 0 denote the Minkowski 
sum and the Pontryagin difference (respectively), defined as 
in [25]. The set No is the set of all the positive integers 
including 0. The zero vector and the identity matrix in K" 
are represented respectively by 0 n and I n . 

II. Problem statement and preliminaries 

The problem is to regulate a linear time invariant system, 
subject to input and state constraints, for which only a 
nominal discrete time state space model is available. Define, 


(A,b) 

Nominal model. 

(la) 

(A, B) 

Real system. 

(lb) 

(A(*),b(*)) 

Estimated model at time i. 

(lc) 

The nominal state space model takes the following form. 

x(i +1) = 

Ax(i) + Bu(i) 

(2a) 

x(i) GlCl", 

u(i ) G»cK m , Vi G N 0 

(2b) 

in which x(i) and u(i) 

are the state and input vectors at time 


i. The following common assumptions are supposed to hold. 

Assumption 1 (Stabilizability). The pairs ( A,B ), (A. /i) 
and (^A(i), B(i)^ are stabilizable Vi G No- 

Assumption 2 (Properties of constraint sets). The set X is 
closed and the set U is compact. Both contain the origin. 


A. Standard MPC formulation 

The standard MPC optimization problem for the system 
in (2) with prediction horizon N at time i is Pat (a; = x(i)): 

N -1 

min V'' (x T (k)Qx(k) + u T (k)Ru(k)) +V/(x(iV)) (3) 

U Z ^ y 


subject to: 

x(0) = x (4a) 

x(k + 1) = Ax(k) + Bu(k) (4b) 

x{k) G X, u{k ) G U, k = 0,1,..., N — 1 (4c) 

x(N) gX/CX (4d) 

where (Q,i?) are the state and input weight matrices. Once 
the optimization is solved, the first part of the optimal input 
sequence is applied to the plant, a new state measurement is 
taken, and the process is repeated. 

It is well known that, under Assumptions 1 and 2, an ap¬ 
propriate selection of the weight matrices, terminal cost Vf (•) 
and terminal constraint X f provides closed-loop asymptotic 
stability of the origin [25] for the nominal system. In 
particular, we use the following standard assumption: 

Assumption 3 (Stability assumption). Vf : X/ -* R> 0 is 
continuous and Vf(O n ) = 0 . Q is positive semidefinite and 
R is positive definite. The set X/ is a closed control invariant 
set for (4b), contains the origin in its interior, and, 

3ugU s.t. Vf (Ax + Bu ) + £(x, u) < V/(x), Vx G X/ 

If the initial state is inside the feasibility region defined by 
constraints (4c) and (4d) (called from now on A)v), recursive 
feasibility of the optimization can be guaranteed [25]. 

B. Persistence of excitation 

For many reasons, the nominal model (A, B) may not 
be an accurate representation of the real system ( A,B ). 
This could have a detrimental effect on the performance 
and stability of the MPC controlled system; therefore, to 
reduced model uncertainty, some form of closed-loop system 
identification should be implemented. Note that a state space 
model may be regarded as a system of ARX( 1,1) models, 
hence a predictor can be built for each component of the 
state vector. 


Xj{i) 

= </> T (i- 1)9j(i), 

3 = 1,2,.. 

., n (5a) 

f T (i) 

= [z T W u T (i)] 


(5b) 

0j(i) 

=uaa 4-«r. 

7 = 1,2,.. 

•, n (5c) 


In (5), Xj(i ) represents the prediction of the state component 
j, at time i, is the regressor vector and ^Aj(i), Bj(i)^J 
are the j lh rows of the currently estimated matrices (lc). A 
standard RLS algorithm with constant forgetting factor A [24] 



is employed to identify a new model every time step. The 
recursion at time i is computed as follows, 

Oj(i) =9j(i - !) + (*)<£(*) [*j(*) 

-<j> r (i-l)9j(i-l)], j = 1,2,... ,n (6a) 
Rm(i) =XRm(i - 1) + </>(i)</> T (f), j = 1,2,...,n (6b) 

Convergence of the estimated parameters 0, is guaranteed if 
the regressor is a strongly persistently exciting sequence [7], 


Definition 1 (Strongly persistently exciting sequence). The 
sequence {( j>(i )} = (f)(1), ■ ■ ■ is said to be 

strongly persistently exciting of order N p at time i, if there 
exists an integer l p and real numbers po, pi >0 such that. 


(, p -L 

PlI(n-\-m)N p ^ —j'j ^ PoI{n+m)Np 


3=0 


4>i-i 


- j ) 

(t>{i - j - 1) 

</>(i — j — Np + 1) 


The variable 7V p defines the length of a time window 
that is going to be observed and the variable l. p defines the 
number of time instants into the past that this window will be 
observed. Definition 1 is identical to definition 3.4.A given in 
[7] but after a time shift, which objective is to set the current 
time i as the upper time limit (i.e. the window is placed 
at time i and it moves backwards). In this way, coupling 
with the receding horizon fashion of MPC is achieved in a 
straightforward way. 

Persistence of excitation of the regressor vector is not 
a suitable condition to use as a constraint in the MPC 
context, mainly because the state vector is not an explicit 
decision variable of the optimization (3)-(4). Within the 
MPC framework, it is more convenient to focus on the 
input, and how the persistence of excitation propagates to 
the regressor. To do this, the concept of state reachability is 
employed. 


Definition 2 (State reachability). System (2) is said to be 
state reachable if, for any x £ X, there exists an input 
sequence {u(j) £ U}j = o:s<oo such that at time s, x(s ) = x. 


Proof. This proof can be found in [26] (Theorem 2.1). ■ 

In [16] it is shown that the lower bound of (7a) charac¬ 
terizes the outside of an ellipsoid, hence the PE constraint 
is non-convex. Also note that = [O m (jv ,-i)] ( or any 
fixed value) violates (7a), therefore regulation to a steady 
state and persistence of excitation cannot be simultaneously 
attained. 

III. Tube MPC with persistence of excitation 

The main contribution of this paper is presented in this 
section, the underlying idea is to include a persistence of 
excitation constraint in a standard MPC formulation. This is 
done from a robust control perspective, where the excitation 
is regarded as a bounded disturbance. The proposed approach 
uses tube MPC, which is a robust control technique with 
guaranteed stability under bounded additive uncertainties, but 
complexity similar to conventional MPC [23], [25]. 

A. Tube MPC for uncertain systems with partitioned input 

Tube MPC solves the regulation problem for an undis¬ 
turbed nominal model, while securing that the state of the 
uncertain system will always be in a robust positive invariant 
(RPI) set [5], centred around the nominal system trajectory. 
This robust control technique is inherently capable of dealing 
with model uncertainties as long as these can be quantified, 
i.e., treated as a bounded additive disturbance. This requires 
a certain insight on how different the real system (la) and 
prediction model (lc) may be. To account for this, the 
following assumption is supposed to hold, 

Assumption 4 (Size of parametric uncertainty). A set Ws := 

{ws = {A — A{i))x+ ( B — B(i))u \ (x,u) £ X x U, Vi £ 

No} is known. 

Consider the model structure of (2). Henceforth, the input 
will be divided into a regulator part, u, and a persistently 
exciting part, w. The nominal model (2) is rewritten as, 

x(i + 1) = Ax(i) + B ( u(i ) + w(i)) (8a) 

x(i) £ X, u{i) £ U, w(i) eW, Vi £ No (8b) 
Bw(i) = w(i) eW = BW, Vi £ No (8c) 


Theorem 1 (Persistence of excitation of reachable systems). 
The sequence {(f>(i)} = <j>(0),<j>(l),-• • ,<j>(i), with <f>(-) 
defined as in (5b), is said to be strongly persistently exciting 
of order N p at time i if, the system (5a) is state reachable 
and there exists an integer l p and real numbers po,pi > 0 
such that. 


lp— 1 

PlImNp ^ ^ ^ ) '■ > PO-^mNp 

3=0 


u{i - j ) 
u(i-j - 1) 


u(i - j - N p + 1) 


(7a) 


(7b) 


Assumption 5 (Properties of the divided input constraint 
sets). The sets W and U are compact and contain the origin. 
Also U C U © W and is non-empty. 

Assumption 5 implies U©W C U. Note that W is a linear 
mapping of W therefore it maintains compactness [27]. 

The tube MPC approach treats w as a bounded additive 
disturbance. The undisturbed model takes the form, 

z(i + 1) = Az{i) + Bv(i) (9a) 

z(i)eZ = X0§, v{i) £ V = TLJ © K t S, Mi £ No (9b) 

in which K t is any stabilizing gain for (A, B), guaranteed 
to exist in view of Assumption 1. The set S is an RPI set 



for the dynamics of the error between the trajectories of the 
nominal and uncertain models, 

e(i) = x(i) — z(i), e(0) = 0 (10a) 

e[i + 1) = {A + BKf)e(i) + w(i) + ws(i) (10b) 

w(i) e W, ws(i) £ W s, Vi £ No (10c) 

Note that the undisturbed system evolves independently after 
i = 0. For a nominal solution to exist, the constraint space 
defined by constraints (9b) must be non-empty. 

Assumption 6 (Allowable disturbance size for constraint 
satisfaction). The set § is such that § C X and K t E> C U. 


Assumption 6 is not uncommon in robust control imple¬ 
mentations, it simply states the fact that it will not be possible 
to satisfy the constraints if the disturbances are too large. 
After a standard MPC problem (section II-A) is solved for 
the nominal system (9a) under tightened constraints (9b), the 
input to the uncertain system is computed from the following 
control policy, 

u{i) = v(i) + K t (x(i) - z(i)) (11) 


B. Additional PE constraint 


Since w is bounded (8b), the upper bound in Theorem 
1 is trivially fulfilled [15], therefore the focus is placed on 
achieving the lower bound. At time i define. 


lp— 1 


M (' w(i )) = Mj = ^ (wi-jwj_j) - poImN p (12a) 

3=0 

w(i- j) 
w(i-j - 1) 


J t~3 


j(i -j-N p + l) 


(12b) 


That Mi depends only on the past and current exciting 
input makes the following a suitable persistence of excitation 
constraint within the receding horizon fashion of MPC 


Mi > 0 


(13) 


Remark 1. The realization of constraint (13), i.e. a PE 
behaviour of w, does not necessarily imply persistence of ex¬ 
citation in the absolute input u. This is due to the disturbance 
rejection control policy (11). Numerical simulations have 
shown that a proper selection of the linear gain I\ t secures 
transmission of the PE condition towards u. The investigation 
of conditions for gain selection is beyond the scope of this 
paper, hence we require the following assumption. 

Assumption 7 (Persistence of excitation transmission). For 
the linear gain K t , the persistence of excitation of w is 
transmitted to the absolute input sequence u. 

C. Tube MPC with additional PE constraint 

The optimization problem of the proposed model predic¬ 
tive controller, at time i is, Pjv(.z = z(i)): 

N -1 

min E (z T (k)Qz(k) + d T {k)Hd(k)) + Vf(z{N)) (14) 

k -0 


subject to: 


2 (0) 

= 

z, 

z(k + 1) = Az{k) + Bv(k) 

(15a) 

z(k) 

£ 

Z, 

v(k) £ V, k = 0,..., N — 1 

(15b) 

z(N) 

£ 

Z/ 

c z 

(15c) 

w( 0) 

£ 

w 


(15d) 

w(k) 

= 

0m 

, k = l,...,N -1 

(15e) 

M 

(w(i) 

= u?(0)) > 0 

(15f) 


where Z f is the nominal equivalent X/. The cost term 
d T (k)lZd(k) is a straight forward augmentation of the usual 
input cost with, 


d T (fc) = [v T (k) w) T (/c)], k = 0,1,..., N — 1 


n = 


R 


J m X m 

R 


A = A(i), B = B{i) 


(16a) 

(16b) 

(16c) 


IV. Stability and feasibility 

In this section stability and recursive feasibility proofs are 
derived for the PE-Tube MPC. 


A. Stability 

The objective of performing closed-loop system identifica¬ 
tion is to reduce model uncertainty and thereby improve the 
performance of the MPC. Therefore it is safe to assume that 
the initially known model (la) will differ from the actual 
plant being controlled, and from any transitional model (lc) 
given by the recursive identification (6). This represents a 
considerable drawback, as one of the main requirements of 
tube MPC is the computation of an RPI set, which is model 
dependent. Different approaches can be used to compute 
such sets; in [21] for example, Assumption 4 is bypassed 
by showing that, under a type of parametric affine model 
uncertainty, an RPI set computed for a certain model is also 
RPI for a family of models. The definition of a specific set of 
rules for computing § is out of the scope of this paper; for the 
examples shown in Section V a suitable RPI set is computed 
on the basis of Assumption 4 and the error dynamics (10). 

Theorem 2 (Stability of the PE-Tube MPC). If assumptions 
1-6 hold, then the set A := § x {(),,} is asymptotically stable 
with a region of attraction (iJjv©§) xZm for the constrained 
composite system, 

x(i + 1 ) = Ax(i) + Bu(i ) + w(i) + ws(i) 

z(i + 1 ) = Az(i) + Bv(i) 

under the closed-loop control laws defined by (11) and (14)- 
(15) respectively (Zn is the nominal equivalent of ,T.y). 

Proof. Follows directly from the stability proofs in [25]. ■ 

B. Recursive feasibility 

Recursive feasibility of a standard tube MPC formulation 
is provided by initializing inside Z ; \-Q)S. In [16] an additional 
assumption is used to provide a proof of recursive feasibility 
under the effects of the non-convex PE constraint (15f), 



Assumption 8. A feasible solution is available at time i — 1, 
i.e., Mj_i > 0 

Theorem 3 (Recursive feasibility: trivial solution). If As¬ 
sumption 8 holds, then there exists a feasible solution at 
time i for the persistently exciting tube MPC (14)—(15). 

Proof. The proof given in [16] is repeated here for clarifying 
purposes. From (12), 


[ 21 ]): 

A(5) = A + SA 
B(S) = B + SB 


0.42 -0.28 
0.02 0.60 


-0.6 -0.4 
-0.6 -0.85 


(19a) 


0.30 

-0.40 


+ <5 


- 0.2 

0.4 


(19b) 


subject to the following constraints. 


Mi = Mj_i + WiwJ - Wi- tp wJ_ lp 
w l = WiwJ - Wi-l p wJ_ lp 

Clearly then, w l > 0 =>■ M, > 0. The proof is completed 
by noticing that w(i) = w(i — l p ) =>• w l = 0. ■ 

Theorem 3 provides recursive feasibility under the trivial 
periodic repetition of a previous solution, but it does not 
analyze the effect of choosing a different one. In fact, w(i) 
is a decision variable in the proposed optimization problem 
(14), hence the optimizer is free to choose w(i) ^ w(i — 
lp) as long as constraints (15d) and (15f) are not violated. 
Numerical simulations (conducted on the same system used 
as an example in [16]) show that recursive feasibility may 
be lost if periodicity is broken, namely, if the optimization 
algorithm lands in a solution such that w(i) ^ w(i — l p ) . 
The observed behaviour can be summarized in, 

3 w(i) e W s.t., w(i) w(i — lp) 

A Mi > 0, but M i+ 1 < 0 Vru(i + 1) € W 

According to (12). the non-trivial optimized w(i) remains 
in Wi for N p — 1 time steps. To take this into account the 
following constraints are proposed to replace (15e)-(15f), 

w(k) = w(i + k — Ip), k = 1,..., N p — 1 (18a) 

w(k) = 0 TO , k = N p , — 1 (18b) 

M (w(i + k) = w(k)) > 0, A; = 0,1,..., A/p —1 (18c) 


X = {x £ R 2 s.t, \xj\ < 17, j = 1,2} (20a) 

D={«6l s.t, |u| < 4} (20b) 

W = {w £ R s.t, |w| < 0.2} (20c) 

\S\ < 0.15 (20d) 


Since no particular performance requirements are being 
considered, the controller parameters are loosely set to N = 
3, Q = / 2 x 2 and R = 1. The terminal cost Vf, and 
the terminal constraint set If are computed according to 
Assumption 3. A(5) is inherently stable for any 6 following 
(20d), this provides flexibility in choosing the linear gain 
K t = [—0.112 0.354] which is stabilizing and complies 
with Assumption 7. A set W s following Assumption 6 is 
defined for (19)-(20) and the corresponding RPI set § is 
computed. 

Following the directions given in [7], the PE constraint 
parameters are set to N p = 6 and l p = 11; given the size of 
W a value of po = 0.05 is employed. To guarantee recursive 
feasibility, a feasible PE sequence of length N p + l p — 1 has 
to be provided (Assumption 8). However, this sequence only 
acts as a buffer for feasibility purposes, and the actual value 
feed to the system is an optimization variable. 

For the RLS algorithm a forgetting factor A = 0.97 is 
employed. The estimates vector 9j is initialized at the known 
values (A,B). The information matrix /)j D is initialized 
as the null matrix, therefore a pseudo- inverse is computed 
for (5a) until Rio becomes invertible. Albeit a recursion is 
computed at every time instant, the prediction model update 
(16c) is performed only every 3 time steps. 


Theorem 4 (Recursive feasibility: non-trivial solution). If 
Assumption 8 holds, and the constraint (15f) is replaced 
by (18), then there exists a feasible, not necessarily trivial, 
solution at time i for the PE-tube MPC ( 14)— (15). 

Proof. This result is established by extending the proof for 
Theorem 3 to cover N p — 1 time steps. ■ 

Remark 2. Each time instant the optimization updates the 
last element of the initially supplied PE sequence. There¬ 
fore, periodicity arises naturally in the optimized exciting 
sequence. 

V. Simulation results 

This section shows the behaviour of the proposed al¬ 
gorithm through two numerical examples. The task is to 
perform closed-loop system identification while regulating 
the states of the following multi-variable system (taken from 


A. Closed-loop identification capabilities 

An initial state a;(0) = [0 0] T € Z jy is considered 
to assess the closed-loop identification capabilities of the 
proposed algorithm. This is done to avoid the additional 
information that would be generated in the process of 
regulation, so that the effect of the PE constraint can be 
observed independently. Fig. 1 shows the optimized input 
signal generated by the PE-Tube MPC for both, nominal 
system (w) and true plant ( u). As expected, given the initial 
state, the nominal input remains at the origin while the input 
for the uncertain system is, indeed, disturbed by the PE part 
w. Fig. 2 shows the trajectory of the states X\ and Z\. During 
the initial time steps, the algorithm optimizes a PE sequence 
on the basis of the feasible sequence initially supplied, hence 
the transient behaviour observed on the uncertain system 
( x ). Feasibility is maintained during this period thanks to 
the tighter constraint (18). An optimized periodic solution is 
attained fairly fast. 





Fig. 1: Optimized input signal for the nominal system and 
true plant, x(0) = [0 0] T . 



Time instant i 


Fig. 2: First state trajectory for the nominal system and true 
plant, x(0) = [0 0] T . 


of the proposed algorithm, the nominal state shows an 
asymptotic behaviour towards the origin. Due to the periodic 
PE disturbance ( w ), the state of the true plant is ultimately 
bounded to lie inside the set {0„} © §. 

VI. Conclusions and future work 

In this paper, a new way to approach the dual problem of 
system identification and regulation via a model predictive 
controller has been presented. At each time instant, the input 
used to control the system is divided into a persistently 
exciting (PE) part and a regulator part. The PE part is 
treated as a bounded disturbance and a tube MPC, enhanced 
with a PE constraint, is used to regulate the plant. At the 
same time, thanks to the PE constraint, enough information 
is generated for convergence of the estimated parameters. 
Under the proper assumptions, the PE-Tube MPC has proved 
robust stability and recursive feasibility. Future work will be 
focused on the analysis of the transmission of persistence 
of excitation, the implementation of on-line variation of 
the allowable size of perturbations with the objective of 
feasibility enlargement and the shifting of the proposed 
approach to an stochastic framework. 


TABLE I: Evolution of estimated parameters error [%]. 


Parameter 

i = 0 


i 

= 3 



i 

= 6 



i 

= 9 

An 

-17.6 

-1.76 

X 

10 1 

-2.83 

X 

10" 

-li 

-2.24 

X 

10" 

-11 

Al2 

-17.6 

-1.76 

X 

10 1 

1.25 

X 

10" 

-11 

9.48 

X 
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Fig. 3: State trajectory for the nominal system and true plant. 

Table I shows the error of the identified value (w.r.t. true 
plant) for all the system parameters, at several time instants. 
Thanks to the PE effect of w, and in view of the deterministic 
framework, the RLS algorithm has enough information to 
achieve perfect parameter estimation. 

B. Regulation capabilities 

The regulation capabilities of the PE-Tube MPC are eval¬ 
uated by initializing the scheme at x(0) = [8 8] T € -Zjv; Fig. 
3 shows the state trajectory for both, nominal system and 
true plant. As expected, given the stabilizing characteristics 
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